Research in Astron. Astrophys. 2010 Vol. XX No. XX, 000-000 
http:/Avww. raa-joumal.org http.VAvww. iop. org/joumals/raa 



Research in 
Astronomy and 
Astrophysics 



Comparison of Halo Detection from Noisy Weak Lensing 
Convergence Maps with Gaussian Smoothing and MRLens 
Treatment 

Y.-X. Jiao, H.-Y. Shan and Z.-H. Fan 

Department of Astronomy, Peking University, Beijing 100871, Cidna; fanzuhui@pku.edu. cn 
Received [year] [month] [day]; accepted [year] [month] [day] 

Abstract Taking into account the noise from intrinsic ellipticities of source galaxies, we 
study the efficiency and completeness of halo detections from weak lensing convergence 
maps. Particularly, with numerical simulations, we compare the Gaussian filter with the 
so called MRLens treatment based on the modification of the Maximum Entropy Method. 
For a pure noise field without lensing signals, a Gaussian smoothing results a residual 
noise field that is approximately Gaussian in statistics if a large enough number of galax- 
ies are included in the smoothing window. On the other hand, the noise field after the 
MRLens treatment is significantly non-Gaussian, resulting complications in characteriz- 
ing the noise effects. Considering weak-lensing cluster detections, although the MRLens 
treatment effectively deletes false peaks arising from noise, it removes the real peaks heav- 
ily due to its inability to distinguish real signals with relatively low amplitudes from noise 
in its restoration process. The higher the noise level is, the larger the removal effects are 
for the real peaks. For a survey with a source density Ug ^ 30 arcmin^^, the number of 
peaks found in an area of 3 x 3 deg^ after MRLens filtering is only ~ 50 for the detection 
threshold k = 0.02, while the number of halos with M > 5 x 10^'^ M© and with redshift 
2; < 2 in the same area is expected to be ~ 530. For the Gaussian smoothing treatment, the 
number of detections is ~ 260, much larger than that of the MRLens. The Gaussianity of 
the noise statistics in the Gaussian smoothing case adds further advantages for this method 
to circumvent the problem of the relatively low efficiency in weak-lensing cluster detec- 
tions. Therefore, in studies aiming to construct large cluster samples from weak-lensing 
surveys, the Gaussian smoothing method performs significantly better than the MRLens. 
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1 INTRODUCTION 

The weak gravitational lensing effect provides a unique tool in measuring the matter distribution in the 
universe (e.g., Bartelmann & Schneider 2001; Hoekstra et al. 2006; Massey et al. 2007). Its additional 
dependence on the distances to the source, to the lens and between the source and lens makes it an 
excellent probe in cosmological studies of dark energy (e.g., Albrecht et al. 2006; Benjamin et al. 2007; 
Kilbinger et al. 2009; Li et al. 2009). On the other hand, however, different observational and physi- 
cal effects can affect the weak lensing analyses significantly. Being extracted from shape distortion of 
background galaxies, the weak lensing effect on individual source galaxies is severely contaminated by 
their intrinsic ellipticities. Therefore statistical analyses on a large number of galaxies are necessary in 
weak lensing studies. Even so, intrinsic shape alignments of galaxies, including intrinsic -intrinsic and 
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shear-intrinsic correlations, can be an important source of error in cosmic shear correlation analyses. For 
cluster detections from weak lensing convergence maps reconstructed from shear measurements (e.g., 
Kaiser & Squires 1993; Bartelmann 1995; Kaiser 1995; Schneider & Seitz 1995; Squires & Kaiser 1996; 
Bridle et al. 1998; Marshall et al. 2002), even randomly orientated intrinsic ellipticities can result false 
peaks by their chance alignments, which can reduce the efficiency of cluster detections significantly 
(e.g., Schneider 1996; van Waerbeke 2000; White et al. 2002; Hamana et al. 2004; Fan 2007). Thus 
further treatments for a convergence map are normally required to suppress the noise effects. 

The noise from intrinsic ellipticities of source galaxies is essentially shot noise, and thus by av- 
eraging over a relatively large number of source galaxies in weak lensing analyses, the residual noise 
can be effectively reduced. This leads to the normal smoothing treatment. It is clear that the residual 
noise depends on the form of the window function and the smoothing scale. For a Gaussian smoothing 
with a window function of the form W{9) oc exp(— the residual noise can be estimated by 
(Tg f=a ((Tf/2)[l/(27r^^ng)], where is the rms of the intrinsic ellipticity of individual source galax- 
ies, 6c is the smoothing scale, and Ug is the surface number density of source galaxies. For cr^ = 0.3, 
Ug = 30 arcmin"'^ and = 1 arcmin, we have ctq « 0.015. 

Recently, Starck et al. (2006) proposed the MRLens filtering technique, which is based on the 
Bayesian analyses with a multi-scale entropy prior applied. The False Detection Rate (FDR) method is 
used to select significant/non-significant wavelet coefficients (e.g., Starck et al. 2006; Pires et al. 2009). 
The MRLens method suppresses noise adaptively according to the strength of the noise itself. A more 
detailed description of the method is given in §4. 

In this paper, with numerical simulations, we compare Gaussian smoothing with MRLens treatment, 
paying particular attention to the completeness and the efficiency of weak lensing halo detections from 
convergence maps. The rest of the paper is organized as follows. In §2, we describe briefly the weak- 
lensing convergence reconstruction and the Gaussian smoothing. In §3, we present the important aspects 
of the MRLens treatment. Results are shown in §4. Section 5 contains sunnmaries and discussions. 

2 WEAK LENSEVG CONVERGENCE RECONSTRUCTION 

In the weak lensing regime, the convergence k{9) is essentially related to the weighted projection of 
density fluctuations i5 along the unperturbed light path. SpecificaUy, we have 

K.[e) = — ^ — / dwW{w)fK{w) ^- (1) 

2 Jq a(w) 

where Hq is the present Hubble constant, fio is the present matter density of the universe in unit of the 
critical density, w is the radial coordinate, a{iu) is the scale factor of the universe, and, with K being 
the spatial curvature of the universe, 

fK{w) = \K\-'^/^ sm{\K\^/^w) iK>0) 

= w {K = 0) 

= \K\-^/^ sin]i{\K\^/^w) {K <0) . (2) 
The factor W{w) is the weighting function that is related to the source galaxy distribution G{w) by 



W{w)= / dw'G{w') 

J W 



The lensing potential (j) is related to k by 
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(3) 



(4) 



and the shears 71 and 72 are 



71 = 7, > 72 = Ol20. (5) 
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Since both k and 7^ are determined by the lensing potential, they are mutually dependent of each 
other. In the Fourier space, we have (Kaiser & Squires 1993) 

K(fe) = Ci(A:)7i(fe) + C2(fc)72(fe), (6) 

where [ci,C2] = [cos(2(^), sin(2(^)] with k = A; (cos 0, sin 0). 

Observationally, the shear 7 can be extracted from the shape measurement of source galaxy images. 
Under the condition k << 1, we have 

e°bs ~ ^ + , (7) 

where e°^^ and e'^ are the observed elUpticity, and the intrinsic ellipticity of a source galaxy, respec- 
tively. Reconstructed from 6°'^'*, the convergence Kn{k) then contains noise from the intrinsic part, i.e., 

Kr,{k) = c,(fc)e°b«(fe) = K{k) + c„(fc)ef (fe). (8) 

With the transformation back to the real 2-D space and applying a smoothing with the window function 
W{6), we can obtain the smoothed quantities (e.g., van Waerbeke 2000) 

= r(6>) + —i:fj,w{e - 0i)e^{ei) (9) 

Hg 

and 

KM{e) = j dfce-*«c„(fc)S^^(fc), (10) 

where 1]°^^, F, and Km are the smoothed e°'^'', 7 and k„, respectively, and ng and Ng are the surface 
number density and the total number of source galaxies in the field. The noise part of Km due to the 
intrinsic ellipticities is then 

N{e) = -St^ / cifeM/(fe)e-''^ («-«')c„(fc)ef(0,), dD 
ng J 

where W{k) is the Fourier transformation of the window function with the form 



W{k) = — j dOe^^-^WiO). (12) 

Without considering the intrinsic alignment of e"^, it is expected from the central limit theorem 
that the smoothed noise field N{0) is approximately Gaussian in statistics if the effective number of 
galaxies included in the smoothing window is larger than about 10 (e.g., van Waerbeke 2000). In this 
case, smoothing leads to correlations in N{G), and its two-point correlation function is approximately 

< N{d)N{e') >= ^{27rf [ rffee''=-(^'-^)|W^(fe)|2, (13) 

^^g J 

where is the intrinsic dispersion of 6°*^*^. 

The approximate Gaussianity of N{6) allows us to quantify the noise effects straightforwardly. The 
noise effects on cluster mass reconstruction and the noise peak statistics are analyzed in van Waerbeke 
(2000). Even with weak aUgnments of intrinsic ellipticities, N{d) can still be approximately described 
by a Gaussian random field with a modified two-point correlation function including the effects of in- 
trinsic alignments. The enhancement of the noise peak abundance due to the weakly intrinsic alignments 
are analyzed in Fan (2007). In Fan et al. (2010), the effects of the presence of real dark matter halos on 
the noise peak statistics around them as well as the effects of the noise on the peak height of real halos 
are investigated in detail. They further present a model to calculate the total peak abundance in a large- 
scale convergence map, including the peaks corresponding to real halos and the noise peaks from the 
chance alignment of the intrinsic ellipticities of source galaxies. Such a model makes it possible for us to 
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use directly the peaks from convergence maps as cosmological probes without the need to differentiate 
real and false peaks. 

Due to its simple operational procedure and the Gaussian statistics of the residual noise field, the 
smoothing treatment has been widely applied in weak lensing analyses. Different smoothing functions 
have been used in different studies. In this paper, we consider the Gaussian smoothing fimction Wq, 
which is one of the most commonly adopted window functions. Specifically, we have 

^-=4 '"'("I)' '''' 

where 6g is the smoothing scale. Then from Eq. (13), the rms of the noise ctq after smoothing is given 

by 

''o = f2;^- ^^^^ 

In our analyses, we choose CTe = 0.3, the typical value for lensing source galaxies, and Oq = ^ arcmin, 
which is the optimal smoothing scale considering cluster-sized halos. Then for a lensing survey with 
Hg = 30 arcmin"^, ao ~ 0.015, which is about 20 times lower than a^. 



3 MRLENS METHOD 



Starck et al. (2006) introduce a new reconstruction and filtering method, namely. Multi-scale Entropy 
Restoration (MRLens). It is developed from the Maximum Entropy Method. The basic idea is to use 
only 'signals' selected by the so called False Discovery Rate (FDR) (Benjamini & Hochberg 1995) 
to reconstruct the convergence field through a Multi-scale Entropy prior. In the following, we present 
specific steps of MRLens. 



3.1 Wavelet decomposition 

For an original convergence map Kots with N — nx n pixels, the first step of MRLens is to decompose 
the image map into different components representing fine structures of different scales. 

To do this, we first initialize j = and set Co (A;, I) = K,obs{k, I), i.e., j = corresponds to the 
unprocessed map with detailed structures. Then we progressively go to higher j to obtain smoother 
maps through (Starck et al. 2001) 

Cj+^{k,l)=^^hiD{m)hxD{n)Cj{k + Tm,l + 2^n), (16) 

m n 

wheTehioim) = [1/16,4/16, 6/16, 4/16, 1/16] for m = -2, -1, 0, 1, 2, respectively. Defining 

wj+i{k,l) = Cj{k,l) - Cj+iik,l), (17) 

we finally obtain 

J 

Kobs{k,l) = Cj{k,l) + ^wj{k.l), (18) 

where J is a chosen number determined by specific considerations on how smooth we want to go. Here 
we set J = 7. In our following analyses, each map is 3 x 3 deg^ discretized into 1024 x 1024 pixels. Thus 
2-^ = 128 pixels corresponding to 22 arcmin. Because we do not expect to see significant structures 
resulting purely from noise on such a large scale, J = 7 is an appropriate choice. 

It can be seen from Eq. (18) that Cj{k, I) is the most smoothed version of the original map Kobs, 
and the terms in the summation contain ever smaller-scale information with smaller j. 
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3.2 Multiscale Entropy 

With the multi-scale wavelet decomposition, one can then construct an entropy with the obtained wavelet 
coefficients wj {k, I) at each grid {k, I) with j = 1, 2, J. It can generally be written as 

J 

j = l kj 

For h, there are different definitions (e.g., Starck et al. 2006). 

Here we follow Starck et al. (2001) to choose the entropy of NOISE-MSE /i„ in our considera- 
tions. At each scale j, the noise entropy at each grid (fc, I) is derived by weighting the entropy with a 
probability that Wj{k, I) is contributed by noise. Specifically, we have 

r\w3{k,i)\ dhix) 
hn[wj{k,l)] = J Pn[\wj{k,l)\-u]-^U^y,du, (20) 

where Pn[wj {k, /)] is the probability that the coefficient wj (fc, I) can be due to noise, and is given by 

Pn[wj{k,l)] = Prob[M^ >| Wj{k,l) |]. (21) 

Eq. (20) essentially regards the information contained in Wj{k, I) to be built up from the summation of 
dh{u). For each newly added dh{u), depending on the difference \wj{k, l)\ — u, there is a probability 
that it is due to noise. 

For Gaussian noise with rms Gj at scale j, we have 

2 /■+°° 

Pn[wj{k,l)\ = -^=— / exp(-WV2<^|)<^W^ 

\/27rcrj J\wj(k.l)\ 
Wj,k, 

K[w,ik,l)] = ^^'"'^^''\erfc(^ ' ""^'^^'^ I '"^ yu. (23) 

3.3 Selecting significant wavelet coefficients using the False Discovery Rate (FDR) 

The Multiscale Entropy method applies regularizations on wavelet coefficients to minimize noise con- 
tributions while keeping the signal information. Thus for those coefficients which are clearly signals, 
they should be kept unchanged. Then a new Multiscale Entropy is defined as (e.g., Starck et al. 2006) 

hn[wj{k,l)] = Mj{k,l)hn[wj{k,l)], (24) 

where 

Mj{k,l) = 1- Mj{k,l), (25) 
and M is the multi-resolution support defined as (Starck et al. 1995) 

= (26) 

■i^ ' ' if <) IS not sigmficant 

Therefore /i„ means that we only need to regularize those wavelet coefficients which are 'not significant', 
that is, they are likely due to noise. 

Forjudging the significance of a wavelet coefficient, a commonly used criterion is a 'ka' threshold. 
If a coefficient is above the threshold, it is defined to be 'significant'. This is equivalent to set a threshold 



erfcl ) (22) 



and thus 
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for the ratio of 'significant' detections over the total number of pixels being analyzed. Considering a 
Gaussian noise, a la criterion corresponds to a probability of 0.05 for a noise coefficient being mis- 
classified as 'significant'. If we have totally N pixels to consider, the number of false discoveries is 
then on average 0.05A^. If the number of pixels related to real signals in the analyses is comparable 
to 0.057V, the false discovery rate with respect to the number of real signals can be much higher than 
0.05. Increasing k can lower the number of false detections at the expense, however, of the power of 
real detections. To overcome such difficulties, an altemative thresholding technique, the False Discovery 
Rate (FDR), has been proposed (Benjamini & Hochberg 1995; Miller et al. 2001; Hopkins et al. 2002; 
Starck et al. 2006). 

This method can effectively control, in an adaptive manner, the fraction of false discoveries over the 
total number of discoveries, rather than over the total number of pixels analyzed. 

Let Pi , . . . , Pjv denote the p- values ordered from low to high for the N pixels, where p- value is 
defined as 

1 /■°° 

Vvaiuf^ = /H- / Gx^\-{w-Wjf j^af^dw, (27) 

with Wj the average of Wj {k, I) for the scale j over all the pixels. Define 

dj = max I fc,- : Pfc, < ^ | , (28) 

then all the Wj{k, I) with their values larger than dj are classified as 'significant'. Here Cjv = 1 if all the 
pixels are statistically independent. The meaning of aj is approximately the pre-defined false discovery 
rate at scale j with respect to the total number of detections. The larger the q ; value, the larger the 
fraction of Wj {k, I) defined to be 'significant'. In our analyses, we adopt FDR to find the values of M in 
Eq. (26). In the MRLens program, ao is an adjustable parameter, and aj = ao x 2^ (Starck et al. 2006). 



3.4 Multi-scale Entropy Filtering algorithm 

Given the discussions in the previous subsections, the Multi-scale Entropy restoration method reduces 
to find the reconstructed k/ that minimizes /(«/) defined as 

I{>^f) = " "'"'C''^ +0i2J2~^r.[(WKf)j{kM (29) 
" j=l k,l 

where (t„ is the rms of noise in the original convergence map Kots, J is the number of wavelet scales, W 
is the wavelet transform operator and /i„[(W«;/)j (fc, I)] is the multi-scale entropy defined only for non- 
significant coefficients selected by the FDR method. The /3 parameter is calculated under the restriction 
that the residual should have a standard deviation equal to the rms of noise. The best Kf is then obtained 
by iterative calculations. Full details of the minimization algorithm can be found in Starck et al. (2001). 

It can be seen that the two terms in the right of Eq. (29) are balancing each other. While the first 
term tends to keep the information in the most, the second term has the effect to lower the noise as 
much as possible. 



4 RESULTS 



In this section, we present the results of our analyses. For weak-lensing effects from large-scale struc- 
tures in the universe, we use the publicly available ray-tracing weak-lensing maps provided kindly by 
White & Vale (2004). The specific set of lensing maps we analyze are generated from large-scale N- 
body simulations with cosmological parameters = 0.296, ^\ = 0.704, w = —1.0, h = 0.7, and 
as = 0.93. The box size is 300 Mpc h~^, the number of particles is 512^ with to « 1.7 x 10^° Mq /i"^ 
for each, and the softening length is w 20 kpc h~^. There are totally 16 convergence maps and each 
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has a size of 3 x 3 deg pixelized into 1024 x 1024 pixels. The redshift distribution of source galaxies 
follows cx exp[— (z/zq)"^/^] with zq ~ 2/3. 

For each map, we add in Gaussian noise due to the intrinsic ellipticities of source galaxies with the 
variance given by (e.g., Hamana et al. 2004), 



pix 



2 



1 



92. ' 

pix 



(30) 



where d^\^ is the pixel size of the simulated convergence-K map, and is the rms of the intrinsic 
ellipticites taken to be cr^ = 0.3. The surface number density rig depends on specific observations. Here 
we consider Ug = 30 arcmin^^ which is typical for ground-based observations, and Ug — 100 arcmin^^ 
expected from space observations, respectively. Figure 1 presents one set of convergence maps without 
(left) and with (right) noise. It can be seen very clearly that the noise from intrinsic ellipticities of source 
galaxies dominates the map, and certain post-processing procedures are necessary in order to extract 
weak-lensing signals embedded under noise. Here we compare two such methods, namely, the normal 
smoothing method with a Gaussian smoothing function, and the MRLens treatment, paying particular 
attention to their effects on weak-lensing peak statistics. 




Fig. 1 The convergence maps of 3 x 3 deg . The left panel is the noise-free map, and the 
right panel is the noisy convergence map with Ug — 30 arcmin^'^. 



4.1 Statistical properties of residual noise 

Post-processing procedures can reduce noise effectively. However, certain levels of residual noise in- 
evitably remain. It is thus important to understand the statistical properties of the residual noise so that 
we can quantify their effects on weak-lensing cosmological studies properly. For that, we first in this 
subsection consider pure noise maps without including weak-lensing signals. After applying Gaussian 
smoothing and MRLens, respectively, we compare the residual noise-peak statistics in the two cases. 
This is highly relevant to cosmological applications of weak-lensing cluster statistics, in which, high 
peaks in convergence maps are thought to be related to clusters of galaxies and their abundances con- 
tain important cosmological information. The existence of residual noise can generate false peaks in 
convergence maps, which in turn can contaminate the weak-lensing peak statistics significantly. 

With Eq. (30), we generate a 3 x 3 deg^ noise map containing 1024 x 1024 pixels with 9pix = 
0.176 arcmin and the corresponding api^ = 0.22 for rig — 30 arcmin^^ and api^ — 0.12 for Ug — 
lOOarcmin^^. For Gaussian smoothing, we take 9q ~ 1 arcmin. For MRLens, we take = 0.01. 
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In a smoothed map, a positive (maximum)/negative (minimum) peak position is located if its value is 
above/below those of its eight neighboring pixels (e.g., Jain & Van Waerbeke 2000; Miyazaki et al. 
2002). 

Figure 2 shows the probabihty distribution function (PDF) of peaks in the residual noise field for 
the two cases, respectively, with the left for the Gaussian smoothing and the right for the MRLens. In 
each panel, the solid and dashed lines correspond to the results with Ug = 30 arcmin"^ and Ug = 
100 arcmin"^, respectively. The bin size is Ak = 0.005. Both the positive and the negative peaks are 
counted in. Two distinctly different distributions are seen. For the Gaussian smoothing case, the peak 
number distribution has a double-peak behavior at k/cto ^ ±1, in good agreement with that expected 
for a Gaussian random field (Bond & Efstathiou 1987; Van Waerbeke 2000). The rms of the residual 
noise in this case is (Tq « 0.016 for ng = 30 arcmin"^ and uo ~ 0.009 for Ug = 100 arcmin"^, 
in excellent agreement with the theoretical value 0.015 for Ug = 30 arcmin^^ and ao « 0.008 for 
Hg = 100 arcmin"^ calculated from Eq. (15). Considering positive peaks that are relevant for weak- 
lensing analyses, the noise peaks with k/cto ~ 1 , rather than with k/uo = 0, have the highest occurrence 
probability. Such a property of noise can cause statistically a positive shift for the peak height of a 
cluster measured from noisy convergence maps. The shift depends on the density profile of the cluster. 
This noise-induced shift can bias the cluster mass estimation from weak-lensing observations. On the 
other hand, it can increase the weak-lensing detectability of clusters, and thus affect the corresponding 
cosmological studies significantly (Fan et al. 2010). 

For the MRLens case, the residual noise after restoration treatment is low with cto w 0.0029 for 
Ug = 30 arcmin"^ and cto ~ 0.0016 for Ug = 100 arcmin"^, much less than those of the Gaussian 
smoothing. However, the noise statistics is highly non-Gaussian, which results significant complications 
in quantifying the noise effect on weak-lensing signals. The number distribution of noise peaks is nar- 
rowly concentrated around k = 0. Thus unlike the Gaussian smoothing, it seems that we do not expect 
a systematic shift due to noise in weak-lensing cluster peak measurement. It should be noted, however, 
in MRLens, the noise filtering involves restoration procedures based on NOISE-MSE of Eq. (29). The 
results depend on the noise properties [the second term in Eq. (29)] as well as on the properties of 
signals we would like to detect [the first term in Eq. (29)]. The higher the original noise is, the larger 
the fraction of the wavelet coefficients that are suppressed. In such a treatment, the signals are changed 
depending on the original noise level and their own properties. Therefore considering the convergence 
peak for a cluster, the results after MRLens restoration in the cases with and without noise are different. 
In this sense, the existence of noise also induces a systematic bias for the peak value of a cluster, though 
for a reason different from and much more comphcated than that of the Gaussian smoothing case. The 
quantitative modeling of such a bias for MRLens needs to be further explored. 

For MRLens, the uq parameter in FDR affects the classification of significant and non-significant 
wavelet coefficients. A smaller ao results a more stringent criteria for the definition of a significant 
wavelet coefficient, and thus stronger suppressions of noise. To test the ao-dependence, we vary its 
value to obtain different restoration results for pure noise maps. In Table 1, the rms of the residual noise 
for different ao and different Ug are shown. With the increase of Ug, the original noise level decreases 
with (rig)"^/^. It is noted that after MRLens treatment, the rms of the residual noise also approximately 
follows (To oz (ng)^^/^. For the ao-dependence, as expected, the residual noise decreases with the 
decrease of ao. However, this dependence is rather weak. Changing ao from 0.1 to 0.01 only decreases 
ao by ~ 20%. 

4.2 Peak statistics in noisy convergence maps 

Now we consider peak statistics of noisy convergence maps. Figure 3 shows the post-processed maps of 
the right panel of Figure 1 with Gaussian smoothing for 9q = 1 arcmin (upper) and with MRLens for 
ao = 0.01 (lower), respectively. The left panels are for Ug = 30 arcmin"^ and the right panels are for 
Hg = 100 arcmin"^. 
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Fig, 2 The probability distribution functions of peaks in pure noise maps. The solid and 

dashed lines are for Ug = 30 arcmin^^ and ng = 100 arcmin^^, respectively. The left panel 
is for the Gaussian smoothing with = 1 arcmin, and the right panel is for the MRLens 
result with ao = 0.01. 



Table 1 Standard deviation of the reconstruction error with MRLens 
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Comparing to the maps in Figure 1, we see that the post-processing procedures can indeed filter 
out much of the noise so that the real structures in the large-scale mass distribution can be detected. 
For Ug = 30 arcmin"^, the MRLens map looks very smooth with only very massive structures left. On 
the other hand, in the Gaussian smoothing case, small structures can also be seen. However, it contains 
many more noise peaks than that of the MRLens case. For ng = 100 arcmin"^, the map is smoother for 
the Gaussian smoothing case than that with Ug — 30 arcmin^^. The MRLens map, however, appears 
lumpier for the lower noise case. Such opposite trends seen in the Gaussian smoothing and in MRLens 
reflect clearly the different underlying filtering mechanisms between the two smoothing schemes. For 
the Gaussian smoothing, the filtering is mainly performed through an averaging procedure. Given a 
smoothing scale, the peak signals of real clusters are more or less similar regardless the noise level. 
Meanwhile, the noise peaks with relatively high k values are significantly reduced if the noise level is 
lowered. Thus the smoother appearance of the upper right panel is mainly due to the less number of 
high noise peaks than that of the upper left panel. For MRLens, it involves a restoration procedure that 
depends on the original noise level. The smaller the original noise is, the lower the fraction is for the 
wavelet coefficients to be suppressed. It is important to note that the suppression leads to the removal 
of both noise peaks and true peaks of relatively low amplitudes. Thus the lumpier structures seen in the 
lower right panel is largely attributed to the lower level of removal of real structures than that of the 
lower left panel. 

In Figure 4 and Figure 5, we show the probability distribution function of peaks for Gaussian 
smoothing and for MRLens, respectively. The results for each case are obtained by averaging over 
16 simulated maps with noise added. 
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Fig. 3 Noisy convergence maps of 3 x 3 deg^. The upper and lower panels are for the 
Gaussian smoothing with — I arcmin and MRLens with uq — 0.01, respectively. The 
left and right panels are for Ug — 30 arcmin^^ and Ug = 100 arcmin^^, respectively. 



For the Gaussian smoothing results in Figure 4, the black, red dashed, red solid, blue dashed, and 
blue solid lines are for the results of noise free peaks, pure noise peaks with Ug — 30 arcmin^^, noisy 
convergence peaks with Ug ~ 30 arcmin^^, pure noise peaks with ?ig = 100 arcmin^'^, noisy conver- 
gence peaks with rig = 100 arcmin"^, respectively. We can see that in the Gaussian smoothing cases, 
the noise peaks dominate over the real peaks at k < 3(To. At larger k > 3ao, real peaks can be detected 
with high efficiencies. Comparing the blue solid line with the red solid line, we see that by reducing 
the noise level from cto ^ 0.015 to (Tq ~ 0.008, we effectively reduce the number of noise peaks with 
K > 0.025, and thus increase the real peak detection efficiencies significantly. 

In Figure 5 for the MRLens results, the line styles are the same as those in Figure 4. Different 
from that in the Gaussian smoothing cases, here the noise peaks (red and blue dashed lines) contribute 
little to the total number of peaks with k > 0.02 in comparison with the real peaks (black solid line). 
However, the suppression process in the MRLens treatment mistakenly removes a large number of real 
peaks with k < 0.1. Thus we expect a high efficiency but a low completeness in weak-lensing peak 
detections after MRLens filtering. Reducing the original noise level by increasing Ug form 30 arcmin^^ 
to 100 arcmin"^ leads to a less suppression effect. Therefore more peaks with n < 0.1 are kept and the 
completeness of peak detections increases considerably. 

In the next subsection, we investigate and compare explicitly the efficiency and completeness of 
weak-lensing cluster detections in the two smoothing treatments. 




Fig. 4 The probability distribution function of convergence peaks for the case of Gaussian 
smoothing with 9g = I arcmin. The black solid line is for the result of the noise-free con- 
vergence peaks, the red dashed and red solid lines are for the pure noise peaks and noisy 
convergence peaks with Ug = 30 arcmin^^, respectively, and the blue dashed and blue solid 
lines are for the pure noise peaks and noisy convergence peaks with Ug — 100 arcmin^^, 
respectively. The left upper panel includes both maximum and minimum peaks, and the left 
lower panel shows the distribution function for maximum peaks only. The right panel is the 
zoom-in version of the left lower panel. 




Fig. 5 Same as Figure 4 but for the case of MRLens with aQ = 0.01. 



4.3 Efficiency and completeness of weak-lensing cluster detection 

The existence of noise from intrinsic ellipticities of source galaxies results false peaks in convergence 
maps, and thus lowers considerably the efficiency of weak-lensing cluster detections. Increasing the 
detection threshold can increase the efficiency, however at the expense of completeness. In this sec- 
tion, we compare the weak-lensing cluster detection with Gaussian smoothing and with the MRLens, 
respectively. Following Hamana et al. (2004), we define the efficiency fe and completeness /c of clus- 
ter detection with respect to the number of clusters (dark matter halos) above a certain mass threshold. 
Specifically, we have 




Fig. 6 The efficiency /e and completeness fc as functions of the peak detection threshold k. 
The left and right panels are for Ug = 30 arcmin^^ and Ug = 100 arcmin^^, respectively. The 
upper two panels are for the Gaussian smoothing and the lower two panels are for MRLens. 
The lines with symbols are for the completeness, and the lines without symbols are for the 
efficiency. The red, green and blue lines are for the results of halos with A/ > 5 x lO^^M©, 
M > 1 X and M >2x lO^^M©, respectively. 



where Ni denotes the number of convergence peaks with their heights above a detection threshold, Na 
represents the number of dark matter halo with mass above a certain mass threshold, and Nm is the 
number of peaks that have correspondences with dark matter halos among Nu. A peak is defined to be 
associated with its nearest dark matter halo if the location of the peak is within a radius of 12 pixels 
(corresponds to 2.11 arcmin) around the halo. If there are two or more peaks associated with a same 
halo, the highest peak is defined to have the correspondence with the halo. 

Figure 6 shows the results of /e and fc for Gaussian smoothing (upper panels) and MRLens (lower 
panels). The left panels are for rig = 30 arcmin^^, and the right panels are for ?ig — 100 arcmin^^. In 
each panel, the red, green and blue lines are for halos with mass A/ > 5 x 10^^ Mq, AI > Ix 
and M > 2x respectively. The lines with and without symbols are, respectively, for the results 

of completeness and efficiency. The horizontal axis in each panel is the peak detection threshold k. 

We first analyze the Gaussian smoothing cases. As we discuss previously, such a smoothing process 
reserves more or less all the real peaks with scales above the smoothing scale. At mean time, the number 
of noise peaks is large at k < Sctq. Thus a high completeness and a low efficiency are expected when 
the peak detection threshold is low. For Ug = 30 arcmin^^ (upper left), we have co ^ 0.015. At the 
detection threshold k — 0.02 ^ 1.3(To, we have the completeness fc ^ 50%, 65% and 70% for M > 
5 X 10^^ Mq, 1 X 10^^ Mq and 2 x 10^* M©, respectively. The corresponding efficiencies are 30%, 10% 
and 2%. When the detection threshold k > 3cro, the number of noise peaks drops significantly, leading 
to a large increase in the detection efficiency. On the other hand, a considerable fraction of halos are 
missed due to the high detection threshold, resulting a decrease in the completeness. Specifically, at 
K = 0.045 - 3(70, the completeness fc ~ 20%, 35% and 60%, and the efficiency fc - 50%, 25% and 
10%, for A/ > 5 X 10^3 Mq, 1 x lO" M© and 2 x lO^'' M©, respectively. With the increase of Ug to 
Ug = 100 arcmin^^ (upper right), the noise level ctq decreases by a factor of ■\/l00/30 to do ^ 0.008. 
Thus 3(To corresponds to k ^ 0.025. At this detection threshold, the number of noise peaks is smaller 
and correspondingly the efficiency is higher than those with Ug = 30 arcmin"^. On the other hand, the 
number of real peaks does not change much as the noise level decreases, and thus the completeness 
is similar to that of Ug = 30arcmin^^. Quantitatively, at the threshold k = 0.025, the efficiency 
fe ^ 50%, 20% and 8%, in comparison with fc ~ 35%, 12% and 3% in the case of Ug — 30 arcmin"^, 
for AT > 5 X 10^"^ Mq, 1 x 10^* M© and 2 x 10^"* M©, respectively. For the completeness, we have 
fc 30%, 50% and 80% for Ug = 100 arcmin-^. For Ug = 30 ai'cmin^^^ fc - 45%, 60% and 70%. 
While being similar, fc decreases somewhat for A/ > 5 x 10^^ M© and 1 x lO^"' M© with the decrease 
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of noise level. This is in accordance with the analyses of Fan et al. (2010) where they find that the 
existence of noise generates a systematic shift for the real peaks toward higher amplitudes. The shift 
depends on the density profile of dark matter halos associated with the real peaks, and can be as high 
as ~ l(To for NFW halos with low concentrations. In terms of k values, the shift is larger for larger (Tq- 
Thus, in the case of ng = 30 arcmin^^, the relatively large ctq leads to a large shift of the real peak 
heights and consequently a larger number of real peaks above the detection threshold than that in the 
case of Ug = 100 arcmin"^. 

For MRLens, with Hg = 30 arcmin"^ (lower left), the completeness of the weak-lensing cluster 
detection is very low, and fc ^ 10%, 20% and 40% at the threshold k = 0.02, in comparison with 
fc ~ 50%, 65% and 70% in the corresponding Gaussian smoothing case. This is because the suppression 
of the wavelet coefficients aiming to reduce noise removes a large fraction of real peaks in the range 
of K < 0.1 as seen from Figure 5. The total number of peaks in 3 x 3 deg^ with k > 0.02 is only 
~ 53, while the total number of halos in the area with M > 5 x 10^"^ Mq is ~ 530. Thus although 
the efficiency in MRLens here is rather high (~ 80% for M > 5 x 10^^ Mq), the very few number 
of detected halos makes the MRLens method be disadvantageous in comparing with that of simple 
Gaussian smoothing method. For ng = 100 arcmin"^, the noise level is lower and thus the removal 
effect is less significant than the case of Ug = 30 arcmin^^. Consequently, the completeness increases 
considerably with fc ~ 20%, 40% and 65%. Meanwhile, the efficiency decreases somewhat. In this low 
noise case, the differences between the MRLens and Gaussian smoothing in terms of the completeness 
and efficiency are less than those of high noise case. But still, the completeness is lower for MRLens, 
especially considering relatively low mass halos with M > 5 x 10^'^ Mq. 

To further demonstrate the differences between the Gaussian smoothing and the MRLens, in 
Figure 7, we show the peak-halo correspondences explicitly in z — M plane for one of our 3x3 deg^ 
simulation maps, where M is the halo mass in unit of IO^^Mq and z is the halo redshift from simula- 
tions. The halos are the ones located in the solid angle of 3 x 3 deg^ in the considered direction and their 
redshift and mass are taken directly from the halo catalogs constructed by White and Vale (2004). The 
upper and lower panels are for the Gaussian smoothing and the MRLens, respectively. The left and right 
panels correspond to ng = 30 arcmin"^ and ng = 100 arcmin"^, respectively. In each panel, the '+' 
symbols denote the dark matter halos identified in simulations with M > 5x 10^^ Mq and in the redshift 
range of < z < 2. There are very few halos extending to redshift beyond z = 2. The green squares 
represent those halos that have corresponding convergence peaks with k > 0.02. The differences be- 
tween the two filtering methods are strikingly seen. For MRLens with ng = 30 arcmin"^ (lower left), 
a majority of halos with M < 10^^ Mq or with z > 0.8 are missed in weak-lensing detections, con- 
sistent with its extremely low completeness shown in Figure 6. Lowering the noise level by increasing 
ng to ng = 100 arcmin"^ increases the number of halos with associated peaks by nearly a factor of 
2 (lower right). But the number is still much less than that in the Gaussian smoothing case. Therefore 
in studies aiming to detect a large number of clusters from blind surveys and subsequent cosmological 
applications, the Gaussian smoothing method is clearly much better than the MRLens. In addition, the 
noise field after a Gaussian smoothing with ~ 1 arcmin is approximately Gaussian in statistics, and 
thus its effects on weak-lensing cluster detection can be modeled much easier than the case of MRLens 
where the left-over noise is statistically highly non-Gaussian (Fan et al. 2010). 

In MRLens, the ao parameter plays a crucial role in classifying significant and non- significant 
wavelet coefficients. A larger ao leads to a larger fraction of significant coefficients, and thus a less 
suppression effect in MRLens restoration. To see if the problem of low completeness in MRLens cluster 
detection can be largely improved by increasing ao, we analyze the ao dependence for the completeness 
fc as well as for the efficiency fc- The results are shown in Figure 8. The upper and lower panels 
are for ng = 30 arcmin^^ and Ug = 100 arcmin^^, respectively. The red, green and blue lines are 
for M > 5 X lO^^Afg, 1 x lO^M©, and 2 x lO^M©, respectively. The peak detection threshold 
is set to be K = 0.02. We can see that both fc and fe are not very sensitive to ao- Increasing ao 
from 0.01 to 0.1 improves the completeness only by A/c ~ 10% for both ng = 30 arcmin"^, and 
ng = 100 arcmin"^. Meanwhile, the efficiency decreases by A/e 10% — 20%. Therefore increasing 
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Fig. 7 Peak-halo correspondences in z — M plane. The upper and lower panels are for 
the Gaussian smoothing and MRLens, respectively. The left and right panels are for Ug = 
30 arcmin"^ and Ug = 100 arcmin"^, respectively. In each panel, the '+' symbols show all 
the halos with A/ > 5 x 10^^ Mq and the squares show the halos with corresponding conver- 
gence peaks with k > 0.02. 



ao cannot overcome the shortcoming of MRLens considerably. Comparing to the Gaussian smoothing 
cases, the completeness is still low for MRLens weak-lensing cluster detection even with ao — 0.1. 

We then conclude that in probing cosmologies with weak-lensing cluster abundance analyses, in 
which a large sample of clusters is needed, the Gaussian smoothing method performs much better than 
the MRLens method. To overcome the relatively low efficiency for low peaks in the Gaussian smoothing 
treatment, a detection threshold k > 3(To is normally set. In Fan et al. (2010), the noise effects on 
convergence peak statistics can be accurately modeled for the Gaussian smoothing method. Therefore it 
is potentially possible to even include peaks with k < 3cto in the abundance analyses, which can increase 
the number of detected clusters greatly so that to strengthen the derived cosmological constraints on 
different parameters. This will be explored further in our future studies. 



5 SUMMARY 

Constructing cluster samples through their weak-lensing effects has been an important aspect of weak- 
lensing studies. Their statistical abundance contains valuable cosmological information. Observations 
have shown the feasibility in detecting clusters with weak-lensing effects (e.g., Wittman et al. 2006; 
Dietrich et al. 2007; Gavazzi & Soucail 2007; Schirmer et al. 2007; Hamana et al. 2009). In conjunction 
with optical observations, the detailed analyses on the completeness and efficiency of weak-lensing se- 
lected cluster samples also become possible (e.g., Geller et al. 2010). It is noted, however, the efficiency 
and completeness depend on the method applied to reconstruct the convergence field from shear mea- 
surements. Different methods can result residual noise with different statistical properties, and can also 
change the weak-lensing signals differently. In order to extract cosmological information from observa- 
tions, it is therefore crucial to understand how a particular reconstruction method affects the results in 
detail. 
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Fig. 8 The ao dependence of the completeness fc and the efficiency /e. The upper and lower 
panels are for rig = 30 arcmin^'^ and Ug = 100 arcmin"^ respectively. The peak detection 
threshold is set to be k = 0.02. The red, green and blue lines are for the results of halos with 
M > 5 X IQi^M©, M > 1 X IO^^Mq, and M > 2 x IO^'^Mq, respectively. The Hnes with 
and without symbols are for the completeness fc and the efficiency /e, respectively. 



In this paper, we systematically compare the Gaussian smoothing method and the MRLens treat- 
ment to suppress noise from intrinsic ellipticities in convergence maps. We concentrate on convergence 
peak statistics. It is found that while the MRLens method can remove noise very effectively, it mistak- 
enly removes a large fraction of real peaks associated with clusters of galaxies. For Ug = 30 arcmin^^, 
the number of peaks with k > 0.02 after MRLens filtering is only ^ 50 in an area of 3 x 3 deg^ 
in comparison with ^ 530 for the number of halos of A/ > 5 x 10^'^ M©. On the other hand, for the 
Gaussian smoothing treatment, the number of detected clusters is ~ 260. Even with the detection thresh- 
old K = 3<Jq ^ 0.045, which is normally set in the Gaussian smoothing treatment to reduce the number 
of noise peaks in the peak catalog and thus to increase the cluster detection efficiency, the number of 
detected clusters is ^ 100, twice as many as that in the MRLens filtering with the threshold k — 0.02. 
As the accuracy of statistical abundance analyses depends crucially on the number of detected clusters, 
the Gaussian smoothing method is therefore strongly favored to detect clusters as many as possible. 
Furthermore, the Gaussian smoothing leads to a noise field which is approximately Gaussian in statis- 
tics, while the residual noise from MRLens filtering is highly non-Gaussian. Therefore the noise effects 
can be modeled more straightforwardly for the Gaussian smoothing case than that of MRLens (e.g., van 
Waerbeke 2000; Fan 2007; Fan et al. 2010). The recent studies of Fan et al. (2010) on the weak-lensing 
peak statistics with noise included provide an analytical model for the efficiency of peak detections in 
the Gaussian smoothing case. Thus it is possible for us to include peaks with k < 3(To in the analyses. 
Then the number of detected clusters can increase considerably, which in turn can lead to a significant 
improvement in the cosmological constraints derived from weak-lensing cluster statistics. 
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